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Escape Regions of the Active Target Defense 

Differential Game 

Eloy Garcia, David W. Casbeer, and Meir Pachter 


Abstract 

The active target defense differential game is addressed in this paper. In this differential game an 
Attacker missile pursues a Target aircraft. The aircraft is however aided by a Defender missile launched 
by, say, the wingman, to intercept the Attacker before it reaches the Target aircraft. Thus, a team is 
formed by the Target and the Defender which cooperate to maximize the separation between the Target 
aircraft and the point where the Attacker missile is intercepted by the Defender missile, while the 
Attacker simultaneously tries to minimize said distance. This paper focuses on characterizing the set of 
coordinates such that if the Target’s initial position belong to this set then its survival is guaranteed if 
both the Target and the Defender follow their optimal strategies. Such optimal strategies are presented 
in this paper as well. 


I. INTRODUCTION 

In multi-agent pursuit-evasion problems one or more pursuers try to maneuver and reach a 
relatively small distance with respect to one or more evaders, which strive to escape the pursuers. 
This problem is usually posed as a dynamic game (H, (2J, [[3J. Thus, a dynamic Voronoi diagram 
has been used in problems with several pursuers in order to capture an evader within a bounded 
domain [O, 0. On the other hand, [(5]| presented a receding-horizon approach that provides 
evasive maneuvers for an Unmanned Autonomous Vehicle (UAV) assuming a known model of 
the pursuer’s input, state, and constraints. In [|6]|, a multi-agent scenario is considered where a 
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number of pursuers are assigned to intercept a group of evaders and where the goals of the 
evaders are assumed to be known. Cooperation between two agents with the goal of evading a 
single pursuer has been addressed in 0 and 0. 

In this paper we consider a zero-sum three-agent pursuit-evasion differential game. A two-agent 
team is formed which consists of a Target (T) and a Defender ( D ) who cooperate; the Attacker 
{A) is the opposition. The goal of the Attacker is to capture the Target while the Target tries to 
evade the Attacker and avoid capture. The Target cooperates with the Defender which pursues 
and tries to intercept the Attacker before the latter captures the Target. Cooperation between the 
Target and the Defender is such that the Defender will capture the Attacker before the latter 
reaches the Target. Such a scenario of active target defense has been analyzed in the context of 
cooperative optimal control in IflOll . Indeed, sensing capabilities of missiles and aircraft allow 
for implementation of complex pursuit and evasion strategies [[TO . [fl2l . and more recent work has 
investigated different guidance laws for the agents A and D. Thus, in llT3l the authors addressed 
the case where the Defender implements Command to the Line of Sight (CLOS) guidance to 
pursue the Attacker which requires the Defender to have at least the same speed as the Attacker. In 
llT4l the end-game for the TAD scenario was analyzed based on the minimization/maximization 
of the Attacker/Target miss distance for a non-cooperative Target/Defender pair. The authors 
develop linearization-based Attacker maneuvers in order to evade the Defender and continue 
pursuing the Target. A different guidance law for the Target-Attacker-Defender (TAD) scenario 
was given by Yamasaki et.al. IH51 . [fl6l . These authors investigated an interception method called 
Triangle Guidance (TG), where the objective is to command the defending missile to be on the 
line-of-sight between the attacking missile and the aircraft for all time, while the Target aircraft 
follows some predetermined trajectory. The authors show, through simulations, that TG provides 
better performance in terms of Defender control effort than a number of variants of Proportional 
Navigation (PN) guidance laws, that is, when the Defender uses PN to pursue the Attacker 
instead of TG. 

The previous approaches constrain and limit the level of cooperation between the Target and 
the Defender by implementing Defender guidance laws without regard to the Target’s trajectory. 

Different types of cooperation have been recently proposed in fTTTl , HU, GUI, [|20l , [f2TTl . 
If22l . If23l for the TAD scenario. In [fT9l optimal policies (lateral acceleration for each agent 
including the Attacker) were provided for the case of an aggressive Defender, that is, the Defender 
has a definite maneuverability advantage. A linear quadratic optimization problem was posed 
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where the Defender’s control effort weight is driven to zero to increase its aggressiveness. The 
work lt20l provided a game theoretical analysis of the TAD problem using different guidance 
laws for both the Attacker and the Defender. The cooperative strategies in 1(271 allow for a 
maneuverability disadvantage for the Defender with respect to the Attacker and the results show 
that the optimal Target maneuver is either constant or arbitrary. Shaferman and Shima ll22l 
implemented a Multiple Model Adaptive Estimator (MMAE) to identify the guidance law and 
parameters of the incoming missile and optimize a Defender strategy to minimize its control 
effort. In the recent paper Il23ll the authors analyze different types of cooperation assuming the 
Attacker is oblivious of the Defender and its guidance law is known. Two different one-way 
cooperation strategies were discussed: when the Defender acts independently, the Target knows 
its future behavior and cooperates with the Defender, and vice versa. Two-way cooperation where 
both Target and Defender communicate continuously to exchange their states and controls is also 
addressed, and it is shown to have a better performance than the other types of cooperation - as 
expected. 

Our preliminary work ll24ll . Il25ll considered the cases when the Attacker implements typical 
guidance laws of Pure Pursuit (PP) and PN, respectively. In these papers, the Target-Defender 
team solves an optimal control problem that returns the optimal strategy for the T — D team so 
that D intercepts the Attacker and at the same time the separation between Target and Attacker at 
the instant of interception of A by D is maximized. The cooperative optimal guidance approach 
was extended ( (1261 . 1(271 , (1281 ) to consider a differential game where also the Attacker missile 
solves an optimal control problem in order to minimize the final separation between itself and the 
Target. In this paper, we focus on characterizing the region of the reduced state space formed by 
the agents initial positions for which survival of the Target is guaranteed when both the Target 
and the Defender employ their optimal strategies. The optimal strategies for each one of the 
three agents participating in the active target defense differential game are provided in this paper 
as well. 

The paper is organized as follows. Section |IT| describes the engagement scenario. Section 11111 
presents optimal strategies for each one of the three participants in order to solve the differential 
game discussed in the paper. The Target escape region is characterized in Section (TV] Examples 
are given in Section [V| and concluding remarks are made in Section [VI] 
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II. PROBLEM STATEMENT 


The active target defense engagement in the realistic plane (x, y ) is illustrated in Figure Q] The 
speeds of the Target, Attacker, and Defender are denoted by Vt, Vi, and Vd, respectively, and 
are assumed to be constant. The simple-motion dynamics of the three vehicles in the realistic 
plane are given by: 


X T — Vp COS 0, 

ijr = Vt sin 0 

(1) 

X A = Va cos X, 

ij A = V A sin x 

(2) 

xd = Vd cos 0, 

Vd = V D sin 0 

(3) 

where the headings of T, D, and A are, respectively, 0, 0 , and 



In this game the Attacker pursues the Target and tries to capture it. The Target and the Defender 
cooperate in order for the Defender interpose himself between the Attacker and the Target 
and to intercept the Attacker before the latter captures the Target. Thus, the Target-Defender 
team searches for a cooperative optimal strategy, optimal headings 0 * and 0 *. to maximize the 
separation between the Target and the Attacker at the time instant of the Defender-Attacker 
collision. The Attacker will devise its corresponding optimal strategy, optimal heading X*. in 
order to minimize the terminal A — T separation/miss distance. Define the speed ratio problem 
parameter a = Vt/Va ■ We assume that the Attacker missile is faster than the Target aircraft, 
so that a < 1. In this work we also assume the Attacker and Defender missiles are somewhat 
similar, so V D — Vi. 

In the following sections this problem is transformed to an aimpoint problem where each 
agent finds is optimal aimpoint. Furthermore, it is shown that the solution of the differential 
game involving three variables (the aimpoint of each one of the three agents) is equivalent to 
the solution of an optimization problem in only one variable. 

III. DIFFERENTIAL GAME 

We now undertake the analysis of the active target defense differential game. The Target 
(T), the Attacker (A), and the Defender (I)) have “simple motion” a la Isaacs lf29l . We also 
emphasize that T, A, and D have constant speeds of Vt, Va, and Vd, respectively. We assume 
that Va = V D and the speed ratio o = < 1. We confine our attention to point capture, that 

is, the D — A separation has to become zero in order for the Defender to intercept the Attacker. 
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Fig. 1. Active target defense differential game, xt > 0 


T and I) form a team to defend from A. Thus, A strives to close in on T while T and D 
maneuver such that D intercepts A before the latter reaches T and the distance at interception 
time is maximized, while A strives to minimize the separation between T and A at the instant 
of interception. Since the cost is a function only of the final time (the interception time instant) 
and the agents have simple motion dynamics, the optimal trajectories of each agent are straight 
lines. 

In Figure [2] the points A and I) represent the initial positions of the Attacker and the Defender 
in the reduced state space, respectively. A Cartesian frame is attached to the points A and D in 
such a way that the extension to infinity of the segment AD in both directions represents the 
X-axis and the orthogonal bisector of AD represents the V'-axis. The state variables are xa, 
xt , and yr- Notice that all points in the Left-Half-Plane (LHP) can be reached by the Defender 
before the Attacker does; similarly, all points in the Right-Half-Plane (RHP) can be reached by 
the Attacker before the Defender does. 

In this paper we focus on the case where the Target is initially closer to the Attacker than to 
the Defender; in other words, assume that xt > 0. 

With respect to Figure [2] we note that the Defender will intercept the Attacker at point / on the 
orthogonal bisector of AD at which time the Target will have reached point T'. The Attacker aims 
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Fig. 2. Active target defense differential game, xt > 0 


at minimizing the distance between the Target at the time instant when the Defender intercepts 
the Attacker, that is, the distance between point T' and point / on the orthogonal bisector of 
AD where the Defender intercepts the Attacker; the points T and T' represent the initial and 
terminal positions of the Target, respectively. 

A. Cost/Payoff Function 

When xt > 0 the Attacker and the Target are faced with a maxmin optimization problem: the 
Target chooses point v and the Attacker chooses point u on the Y-axis, see Figure [3] Additionally, 
the Defender tries to intercept the Attacker by choosing his aimpoint at point w on the Y-axis. 
Thus, the optimization problem is 

max min J(u,v,w). (4) 

v,w u 

where the function J(u. v. w ) represents the distance between the Target terminal position and the 
point where the Attacker is intercepted by the Defender. The target tries to cross the orthogonal 
bisector of AD into the LHP where the Defender will be able to allow it to escape by intercepting 
the Attacker at the point (0. u) on the orthogonal bisector of AD. Therefore, the Defender’s 
optimal policy is w*(u , v) = u in order to guarantee interception of the Attacker. The optimality 
of this choice by the Defender will be shown in Proposition [3 
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Fig. 3. MaxMin Optimization Problem, u > v 


Since the Defender’s optimal policy is w* = u, the decision variables u and v jointly determine 
the distance S between the Target terminal position T' and the point (0, u) where the Attacker 
is intercepted by the Defender. This distance is a function of the decision variables u and v. 
Thus, the Attacker and the Target solve the following optimization problem 

max min J(u,v). (5) 

V U 

Now, let us analyze the possible strategies. If the Target chooses v, the Attacker will respond and 
choose u. If u 7 ^ v the Target would correct his decision and choose some v such that S > S, as 
shown in Figure [3] for the case where u > v and in Figure |4] for the case where u < v. In general, 
choosing u ^ v is detrimental to the Attacker since his cost will increase. Thus, the Attacker 
should aim at the point v which is chosen by the Target, that is, u*(v ) = arg min ?J J(u,v ) = v. 

Proposition 1: Given the cost/payoff function J(u,v), the solution u* and v* of the optimiza¬ 
tion problem max„ min„ ,/(u, v) is such that 

u* = v*. ( 6 ) 

Moreover, when xt > 0, the Attacker strategy is u*(v) = arg min, ( J(a. v) = v so that it suffices 
to solve the optimization problem max s ,/(y) where 

J (■ y) = asjx\ + y 2 - ^(y - y T ) 2 + x 2 T . (7) 
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Fig. 4. MaxMin Optimization Problem, u < v 


B. Critical Speed Ratio for Target Surx’ival 

We assume that the Attacker is faster than the Target, for otherwise the Target could always 
escape without the help of the Defender. Thus, we assume that the speed ratio 0 < a < 1. 
Also, we assume that xt > 0. The Target needs to be able to break into the LHP before 
being intercepted by the Attacker for the Defender to be able to assist the Target to escape, 
by intercepting the Attacker who is on route to the Target. Thus, a solution to the active target 
defense differential game exists if and only if the Apollonius circle, which is based on the 
segment AT and the speed ratio a, intersects the orthogonal bisector of AD. This imposes 
a lower limit a on the speed ratio, that is, we need a < a < 1. The critical speed ratio a 
corresponds to the case where the Apollonius circle is tangent to the orthogonal bisector of AD. 
And if the speed ratio a > 1 the Target always escapes and there is no need for a Defender 
missile, that is, no target defense differential game is played out. The optimal strategies for the 
case x T < 0 can be obtained in a similar way as shown in this paper and the critical speed ratio 
is a = 0. 

Proposition 2: Assume that xt > 0. Then, the critical speed ratio a is a function of the 
positions of the Target and the Attacker and is given by 

yj a + x T ) 2 + Vt~ V( x a ~ %t) 2 + y\ 

a — ---• (oj 

2x a 
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Proof. The Attacker’s initial position, the Target’s initial position, and the center O of the 
Apollonius circle are collinear and lie on the dotted straight line in Figure [2] whose equation is 


Vt . %aVt 

- x H- 


y = - 

Xa — XT Xa — Xt 

The geometry of the Apollonius circle is as follows: The center of the circle, denoted by O, is 

2 

at a distance of from T and its radius is T^sd, where d is the distance between A and 

1 —or 1 —or 5 

T and is given by 


d = sj ( x A ~ x T ) 2 + y\. 


Hence, the following holds 


xtVt 


— —x 0 ) + (xq - x T ) 2 = a [(x A - x T ) 2 + Vt] 
i — xt / 1 — or r 


.xa — xt xa — xt s ' ' (1 — a 2 ) 2 

and we calculate the coordinates of the center of the Apollonius circle 

Xo = jz^xt - X A , Vo = jz^Vt- 


(9) 


( 10 ) 


( 11 ) 


Consequently, the critical speed ratio a is the positive solution of the quadratic equation 

XT - c?x A = OI^J{x A ~ X T ) 2 + Vt (12) 

which is given by ®. 


□ 


In general, it can be seen from Figure [2] that if xt < 0 then a = 0 as well. We will assume 
a < a < 1 , so that a solution to the active target defense differential game exists; otherwise, if 
a < a, the Defender will not be able to help the Target by intercepting the Attacker before the 
latter inevitably captures the Target; and if a > 1 then the Target can always evade the Attacker 
and there is no need for a Defender. 


C. Optimal Strategies 

When the Target is on the side of the Attacker, the Target chooses its aimpoint, denoted by 
I, on the orthogonal bisector of AD in order to maximize its payoff function (J7]), the final 
separation between Target and Attacker, and where y represents the coordinate of the aimpoint 
/ on the orthogonal bisector of AD. This is so because the Attacker will aim at the point I. In 
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order to minimize ([7]) the optimal strategy of the Attacker is to choose the same aimpoint / on 


the orthogonal bisector of AD, where it will be intercepted by the Defender. 

In order to find the maximum of © we differentiate eq. © in y and set the resulting derivative 
equal to zero 

dJ(y ) ay y-yr 


= 0 . 


(13) 


dy \/x 2 A + y 2 y/(y - y T ) 2 + x% 

The following quartic equation in y > 0 is obtained 

(1 - a 2 )y 4 - 2(1 - a 2 )y T y 3 + ((l-a 2 )y R +x 2 A -a 2 x 2 T )y 2 - 2 x\y T y + x\y% = 0. (14) 

In the sequel we focus on the case 0 < a < 1. In addition and without loss of generality 
assume that y T > 0. Let us divide both sides of eq. (fl4l) by y^ and set x A = x T = and 
y = -jj^, whereupon the quartic equation (fl4l) assumes the canonical form 

(1 — a 2 )y 4 — 2(1 — a 2 )y 3 + (l — a 2 + x 2 A — a 2 x^ r )y 2 — 2 x A y + x 2 A = 0. (15) 

We are interested in the real and positive solutions y > 0 of the canonical quartic equation (fl5l) . 
Eq. (IT5l) has two real solutions, 

0 < 2 /zz! < 1 and y R2 > 1 . 

When xt = 0, (fl5T) has two repeated solutions at y — 1 and two complex solutions y = 


dzi 


Vl— a ' 2 


X A . 


Remark. Writing the quartic equation (fl4l) as f(y) = 0 we see that /(0) = x\y\ > 0, 
f{yr) = —a 2 x R y R < 0, and /(oo) = +oo. Therefore, equation (fl4l) has two real solutions. 
Equation (fl4l) has a real solution 0 < y < y T and an additional real solution y T < y, provided 
that xt 7 ^ 0. Note that the quartic equation (fl4l) is parameterized by x R , so whether x R > 0 or 
xt < 0 makes no difference as far as the solutions to the quartic equation (fl4l) are concerned. 
However, if xt < 0 the applicable real solution is y < yT, whereas if > 0 the applicable real 
solution is y > y T - 

When xt > 0, by choosing his heading, the Target (and the Defender) thus choose the 
coordinate y to maximize J{y)', that is, y is the Target’s (and Defender’s) choice. Then the 
payoff is given by eq. © and the expression for was shown in (fl3l) . The second derivative 
of the payoff function 


d 2 J(y) 

dy 2 


ax\ 


Jyrjn 


{x\+y 2 ) 3/ 2 ((y-y T ) 2 +4) 3/2 ' 


(16) 
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The Target is choosing y to maximize the cost J(y). Now, the Attacker reacts by heading 
towards the point / on the orthogonal bisector of AD where, invariably, he will be intercepted 
by the Defender. Both the Target and the Attacker know that the three points T, I, T' must be 
collinear. The defender will not allow the Attacker to cross the orthogonal bisector because then 
the Attacker will start to close in on the Target. 

The optimal coordinate y* is the solution of the quartic equation (fl4l) such that the second- 
order condition for a maximum holds on '' < 0. In view of (fl3l) we know that 


1 y 1 

— 7 = OL - 

y/(y - Vt ) 2 + X 2 T V~Vt yjx 2 A + y 2 

and inserting (fT71) into (fT6l) yields 

d 2 J{y ) a 

dy 2 (x\ + y 2 ) 3 / 2 


x 2 a 


a 


y 


y-yr 


Xrj 


(17) 


(18) 


We have that '' J,y l! < 0 if and only if 


J_(%a y 

Xt* 


cr 


< 


y 


y-yr 


(19) 


Hence, the first real solution y\ < yr of the quartic equation (fT4l) does not fulfill the role of 
yielding a maximum and the second real solution y 2 > i/t of (fl4l) is the candidate solution. It is 
the Target who chooses y* to maximize the payoff J(y). Note that y\ = yry yr and y 2 = yn 2 J/r, 
where y Rl and y Ri are the real solutions of (fl5l) . 

Inserting eq. (ITTI) into eq. ([7]) yields the Target and Defender payoff 

J*(y) = ols/x\ + y 2 - + y 2 

= iV x A + y 2 {f ~ ^ _ 

and using y = y 2 


( 20 ) 


■T(y) = iV%+vK% -(l-a 2 )). (21) 

When a > a we have that J*(y) > 0. Hence, the solution y 2 > y T of the quartic equation (fl4l) 
must satisfy 


Vt < 2/2 < (22) 

This situation is illustrated in Figure [5] where the three points T, I, and T' are collinear. 
Concerning expression (fl9l) . we also need the solution of the quartic equation (fT4l) to satisfy 


1 

2/2 < 1 _ q ,2/3('£t^2/3 2/T - 


(23) 
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Fig. 5. Optimal Play when xt > 0 


The second real solution y 2 of the quartic equation (fl4l) must satisfy 


Vt <V 2 < min 


jVt, 


i - “ 2 i - “ 2 / 3 (S ) 2/3 


Vt 


(24) 


The points of intersection of the Apollonius circle with the y -axis (the orthogonal bisector) are 
(0, 2 /) and (0, y), where (0,7/) and (0 ,y) are the solutions of the quadratic equation 


x 2 0 + (y - yoY = 


a 


: d 2 


(25) 


(1 — a 2 ) 2 

where the distance d = \J(xa ~ xt) 2 + y\ and the Apollonius circle’s center coordinates are 
given by (fill) . We have that 

1 


(y - yoY = 


(1 -a 2 y 


■ ( a 2 d 2 — (xt — (x 2 xaY) 


where 


a 2 d 2 — (x T — 0 ( 2 xaY > 0 


because a > a and, from (fT2l) . we have that ad = x T — a 2 XA■ Hence, 

1 


y = yo± 


1 — a‘ 


■\Ja 2 d? — (xt — oPxa) 2 
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which results in 


1 ( 

\J a 2 y j, + (1 — a 2 )(a 2 x A — Xj) j 

( 26 ) 

l - 1 _ Q 2 (W- 


J oi 2 y\ + (1 — a 2 )(a 2 x A — x^)^ ■ 

( 27 ) 


The Target’s choice of the optimal y*, namely, the solution y 2 of the quartic equation (fT4l) must 
satisfy the inequalities 

1 * a 2 (vt~ \Ja 2 y^ + (l-a 2 )(a 2 x 2 A - x^ < y 2 

< ('vt + \! a2 y 2 T + (1 -a 2 )(a 2 x 2 A - xfo) • ( 28 ) 

Proposition 3: (Saddle point equilibrium). Consider the case xt > 0. The strategy y* of the 
Target, where y* is the real solution of the quartic equation (fl4l) which maximizes ©, and the 
strategy of the Defender of heading to the point (0, y*), together with the strategy of the Attacker 
of aiming at the point (0,y*), constitute a strategic saddle point, that is 

{J(u*,v*,w),J(u*,v,w*),J(u*,v,w)} < J(u*,v*,w*) < J(u,v*,w*). 

IV. ESCAPE REGION 

In this section we analyze the Target’s escape region for given Target and Attacker speeds, 
V T and Va, respectively. In other words, for given speed ratio a = V t /Va■ Consider the active 
target defense differential game where the Attacker and the Defender missiles have the same 
speeds. When xt > 0, the critical value of the speed ratio parameter a can be obtained as a 
function of the Attacker’s and the Target’s coordinates xa, %t, and ijr such that the Target is 
guaranteed to escape since the Defender will be able to intercept the Attacker before the latter 
reaches the Target. 

Now, for a given Target’s speed ratio, a, and for given Attacker’s initial position, xa, we 
wish to characterize the region of the reduced state space for which the Target is guaranteed to 
escape. In other words, we want to separate the reduced state space into two regions: R e and 
R eo . The region R, is defined as the set of all coordinate pairs (x,y) such that if the Target’s 
initial position (xt, Vt) is inside this region, then, it is guaranteed to escape the Attacker if both 
the Target and the Defender implement their corresponding optimal strategies. The region R So , 
represents all other coordinate pairs (x, y) in the reduced state space where the Target’s escape 
is not guaranteed. 
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Proposition 4: For given speed ratio a and for given Attacker’s initial position x A in the 
reduced state space, the curve that divides the reduced state space into the two regions R e and 
R eo is characterized by the right branch of the following hyperbola (that is, x > 0) 


x 


y 


= l. 


(29) 


a 2 x\ (1 — a) 2 x 2 A 

Proof. The requirement for the Target to escape being captured by the Attacker is that the 
Apollonius circle intersects the Y-axis. The radius of the Apollonius circle is 


r = 


* J{x A - %t ) 2 + Vt 
— a- v 


and the X-coordinate of its center is 

Xq = 


-{x T - a 2 x A ) 


(30) 


(31) 


1 — a 2 

If xt > o 2 x a , we need r > xo for the Defender to be of any help to the Target. Thus, x A > 0, 
y T > 0, and > 0 must satisfy the condition 

1 

1 — a' 2 v 1 — a- 

equivalently, 


- ~(x T - a 2 x A ) < a 9 \/(x a - x T ) 2 + y\ 

— cv z \ — v 


(32) 


xt — a 2 x A < a^l ( x A — xt ) 2 + y\ 


which is also equivalent to 


■r 2 

X A 


+ 


Vt 


> 1. 


(33) 


(34) 


(i ?) 2 1 (^x T y 

When the ‘greater than’ sign in inequality (l34l) is changed to ‘equal’ sign, the resulting equation 


defines the curve that divides the reduced state space into regions R e and R Co . Additionally, 
since the symmetric case yr < 0 can be treated in a similar way as the case yr > 0, we do not 
need to restrict y T to be greater than or equal to zero. Thus, the coordinate pairs [x. y) such that 


x~ 


+ 


y~ 




= 1 


can be written in the hyperbola canonical form shown in 


□ 

Remark. Note that for a given speed ratio a, the family of hyperbolas characterized by different 
values of x A > 0 shares the same center which is located at C — (0, 0), and the same asymptotes 
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Fig. 6. Attacker optimal strategy when J(y*) < 0 


which are given by the lines y = and y = — x. One can also see that, for 0 < a < 1, 

the slope of the asymptotes increases as a decreases and viceversa. This behavior is expected 
since a relatively faster Target will be able to escape the Attacker when starting at the same 
position as a relatively slower Target. 

It is important to emphasize that if ( xt , !Jt) £ Re a then capture of the Target by the Attacker 
is guaranteed if the Attacker employs its optimal strategy. In this case the optimal strategies 
described in Section ITlI-CI will result in J(y*) < 0. J(y*) being negative makes sense in terms 
of the differential game formulated in this paper (recall that the Attacker tries to minimize J(y)). 
However, the cost/payoff function J(y) represents a distance and it does not make sense for it to 
be negative in the real scenario where the Attacker tries to capture the Target, i.e. the terminal 
separation AT should be zero instead of negative. Based on the solution of the differential game 
presented in Section IIII-C1 the Attacker is able to redefine its strategy and capture the Target, that 
is, to obtain AT = 0. The new strategy is as follows. The Attacker, by solving the differential 
game and obtaining the optimal cost/payoff, reali z es that J(y*) < 0, then, it simply redefines its 
optimal strategy to be u'*(v') = v'. 

The details when the optimal strategies of Section ITlI-CI result in ■f(y*) < 0 are as follows. 
T chooses his aimpoint to be v' that lies on the Apollonius circle. A reali z es that J(y*) < 0 
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Fig. 7. Example 1: Escape region 


(equivalently, the Apollonius circle does not intersect the Y-axis) and chooses his aimpoint u' 
also on the Apollonius circle - see Figure [6) Given T’s choice of v', the soonest A can make 
AT = 0 is by capturing T on the Apollonius circle (otherwise the Target will exit the Apollonius 
circle and the Defender may be able to assist the Target). Thus, u'*(v') = v'. 

Similarly, T solves the differential game and obtains J(y*) < 0. This information is useful 
to T and it realizes that D is unable to intercept A. Thus, T will be prepared to apply passive 
countermeasures such as releasing chaff and flares. T can also change its objective and find 
some u'* in order to optimize a different criterion such as to maximize capture time; however, 
this topic falls outside the scope of this paper. 

V. EXAMPLES 

Example 1. Consider the speed ratio a = 0.5 and the Attacker’s initial position xa = 6. The 
right branch hyperbola shown in Figure [7] divides the Target escape/capture regions. 

Simulation: Let the Target’s initial coordinates be xt = 3 and y T = 2. Note that (xt, Vt) G R e - 
The Y-coordinate of the optimal interception point is given by y* = 2.6108. Figure [8] shows the 
results of the simulation. The optimal cost/payoff is J(y*) = 0.2102 and the Target escapes 
being captured by the Attacker. 

Example 2. For a given speed ratio cc, we can plot a family of right hand hyperbolas on the 
same plane for different values of xa- Consider a = 0.7. Figure [9] shows several hyperbolas (l29l) 
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Fig. 8. Example 1: Simulation 



X 


Fig. 9. Example 2: Family of right branch hyperbolas 


for values of xa = 1,2,8. 


VI. CONCLUSIONS 

A cooperative missile problem involving three agents, the Target, the Attacker, and the De¬ 
fender was studied in this paper. A differential game was analyzed where the Target and the 
Defender team up against the Attacker. The Attacker tries to pursue and capture the Target. The 
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Target tries to evade the Attacker and the Defender helps the Target to evade by intercepting 
the Attacker before the latter reaches the Target. This paper provided optimal strategies for each 
one of the agents and also provided a further analysis of the Target escape regions for a given 
Target/Attacker speed ratio. 


References 

[1] S. A. Ganebny, S. S. Kumkov, S. Le Menec. and V. S. Patsko, “Model problem in a line with two pursuers and one 
evader,” Dynamic Games and Applications , vol. 2, no. 2, pp. 228-257, 2012. 

[2] H. Huang, W. Zhang, J. Ding, D. M. Stipanovic, and C. J. Tomlin, “Guaranteed decentralized pursuit-evasion in the plane 
with multiple pursuers,” in 50th IEEE Conference on Decision and Control and European Control Conference , 2011, pp. 
4835-4840. 

[3] K. Pham, “Risk-averse based paradigms for uncertainty forecast and management in differential games of persistent 
disruptions and denials,” in American Control Conference , 2010, pp. 842-849. 

[4] E. Bakolas and P. Tsiotras, “Optimal pursuit of moving targets using dynamic Voronoi diagrams,” in 49th IEEE Conference 
on Decision and Control, 2010, pp. 7431-7436. 

[5] J. Sprinkle, J. M. Eklund, H. J. Kim, and S. Sastry, “Encoding aerial pursuit/evasion games with fixed wing aircraft into a 
nonlinear model predictive tracking controller” in 43rd IEEE Conference on Decision and Control, 2004, pp. 2609-2614. 

[6] M. G. Earl and R. DAndrea, "A decomposition approach to multi-vehicle cooperative control,” Robotics and Autonomous 
Systems, vol. 55, no. 4, pp. 276-291, 2007. 

[7] Z. E. Fuchs, P. P. Khargonekar, and J. Evers, “Cooperative defense within a single-pursuer, two-evader pursuit evasion 
differential game,” in 49th IEEE Conference on Decision and Control, 2010, pp. 3091-3097. 

[8] W. Scott and N. E. Leonard, “Pursuit, herding and evasion: A three-agent model of caribou predation ” in American Control 
Conference, 2013, pp. 2978-2983. 

[9] R. L. Boyell, “Defending a moving target against missile or torpedo attack,” IEEE Transactions on Aerospace and Electronic 
Systems, vol. AES-12, no. 4, pp. 522-526, 1976. 

[10] -, “Counterweapon aiming for defence of a moving target,” IEEE Transactions on Aerospace and Electronic Systems, 

vol. AES-16, no. 3, pp. 402-408, 1980. 

[11] P. Zarchan, Tactical and strategic missile guidance. AIAA Progress in Aeronautics and Astronautics, Reston, VA, 1997, 
vol. 176. 

[12] G. Siouris, Missile guidance and control systems. New York, Springer, 2004. 

[13] A. Ratnoo and T. Shima, “Line-of-sight interceptor guidance for defending an aircraft,” Journal of Guidance, Control, and 
Dynamics, vol. 34, no. 2, pp. 522-532, 2011. 

[14] S. Rubinsky and S. Gutman, “Three-player pursuit and evasion conflict,” Journal of Guidance, Control, and Dynamics, 
vol. 37, no. 1, pp. 98-110, 2014. 

[15] T. Yamasaki and S. N. Balakrishnan, “Triangle intercept guidance for aerial defense,” in AIAA Guidance, Navigation, and 
Control Conference. American Institute of Aeronautics and Astronautics, 2010. 

[16] T. Yamasaki, S. N. Balakrishnan, and H. Takano, “Modified command to line-of-sight intercept guidance for aircraft 
defense,” Journal of Guidance, Control, and Dynamics, vol. 36. no. 3, pp. 898-902, 2013. 

[17] A. Perelman, T. Shima, and I. Rusnak, “Cooperative differential games strategies for active aircraft protection from a 
homing missile,” Journal of Guidance, Control, and Dynamics, vol. 34, no. 3, pp. 761-773, 2011. 


18 



[18] I. Rusnak, “The lady, the bandits, and the bodyguards-a two team dynamic game,” in Proceedings of the 16th World IFAC 
Congress , 2005. 

[19] I. Rusnak, H. Weiss, and G. Hexner, “Guidance laws in target-missile-defender scenario with an aggressive defender,” in 
Proceedings of the 18th IFAC World Congress, 2011, pp. 9349-9354. 

[20] A. Ratnoo and T. Shima, “Guidance strategies against defended aerial targets,” Journal of Guidance, Control, and Dynamics, 
vol. 35, no. 4, pp. 1059-1068, 2012. 

[21] T. Shima, “Optimal cooperative pursuit and evasion strategies against a homing missile,” Journal of Guidance, Control, 
and Dynamics, vol. 34, no. 2, pp. 414-425, 2011. 

[22] V. Shaferman and T. Shima, “Cooperative multiple-model adaptive guidance for an aircraft defending missile,” Journal of 
Guidance, Control, and Dynamics, vol. 33, no. 6. pp. 1801-1813, 2010. 

[23] O. Prokopov and T. Shima, “Linear quadratic optimal cooperative strategies for active aircraft protection,” Journal of 
Guidance, Control, and Dynamics, vol. 36, no. 3, pp. 753-764, 2013. 

[24] E. Garcia, D. W. Casbeer, K. Pham, and M. Pachter, “Cooperative aircraft defense from an attacking missile,” in 53rd 
IEEE Conference on Decision and Control, 2014, pp. 2926-2931. 

[25] E. Garcia, D. W. Casbeer, K. Pham, and M. Pachter, “Cooperative aircraft defense from an attacking missile using 
proportional navigation,” in 2015 AIAA Guidence, Navigation, and Control Conference, 2015. 

[26] M. Pachter, E. Garcia, and D. W. Casbeer, “Active target defense differential game,” in 52nd Annual Allerton Conference 
on Communication, Control, and Computing, 2014, pp. 46-53. 

[27] E. Garcia, D. W. Casbeer, and M. Pachter, “Active target defense differential game with a fast defender,” in American 
Control Conference, 2015. 

[28] E. Garcia, D. W. Casbeer, and M. Pachter, “Cooperative strategies for optimal aircraft defense from an attacking missile,” 
AIAA Journal of Guidance, Control, and Dynamics, DOI: 10.2514/1.G001083, 2015. 

[29] R. Isaacs, Differential Games, New York: Wiley, 1965. 


19 



